Exploring Vector Search at Scale

Stephen Batifol • Location: TUECHTIG • Back to Haystack EU 2024

Milvus is an open-source vector database built to power Gen AI solutions. 80% of the data in the world is unstructured data, and vector databases are the databases that help you get valuable insights from unstructured data. With this in mind, we built Milvus as a distributed system on top of other open-source solutions, including MinIO and Kafka, to support vector collections that exceed billion-scale. This session will explore the architecture decisions that make it possible to have Vector Search at Billion scale. We will talk about the different indexes that are needed, why being distributed is important and what are the tweaks that are needed to achieve such a scale. This talk will also have a live demo to showcase the capability of Vector Search at Scale.

Stephen Batifol

Milvus / Zilliz

Stephen Batifol is a Developer Advocate at Zilliz. He previously worked as a Machine Learning Engineer at Wolt, where he created and worked on the ML Platform, and previously as a Data Scientist at Brevo. Stephen studied Computer Science and Artificial Intelligence. He is a founding member of the MLOps.community Berlin group, where he organizes Meetups and hackathons. He enjoys boxing and surfing.